A Framework for Figurative Language Detection Based on Sense Differentiation
نویسنده
چکیده
Various text mining algorithms require the process of feature selection. High-level semantically rich features, such as figurative language uses, speech errors etc., are very promising for such problems as e.g. writing style detection, but automatic extraction of such features is a big challenge. In this paper, we propose a framework for figurative language use detection. This framework is based on the idea of sense differentiation. We describe two algorithms illustrating the mentioned idea. We show then how these algorithms work by applying them to Russian language data.
منابع مشابه
Linguistic-based Patterns for Figurative Language Processing: The Case of Humor Recognition and Irony Detection
Figurative language represents one of the most difficult tasks regarding natural language processing. Unlike literal language, figurative language takes advantage of linguistic devices such as irony, humor, sarcasm, metaphor, analogy, and so on, in order to communicate indirect meanings which, usually, are not interpretable by simply decoding syntactic or semantic information. Rather, figurativ...
متن کاملSentiment Analysis of Figurative Language using a Word Sense Disambiguation Approach
In this paper we propose a methodology for sentiment analysis of figurative language which applies Word Sense Disambiguation and, through an n-gram graph based method, assigns polarity to word senses. Polarity assigned to senses, combined with contextual valence shifters, is exploited for further assigning polarity to sentences, using Hidden Markov Models. Evaluation results using the corpus of...
متن کاملBrighter than Gold: Figurative Language in User Generated Comparisons
Comparisons are common linguistic devices used to indicate the likeness of two things. Often, this likeness is not meant in the literal sense—for example, “I slept like a log” does not imply that logs actually sleep. In this paper we propose a computational study of figurative comparisons, or similes. Our starting point is a new large dataset of comparisons extracted from product reviews and an...
متن کاملDesign and implementation of Persian spelling detection and correction system based on Semantic
Persian Language has a special feature (grapheme, homophone, and multi-shape clinging characters) in electronic devices. Furthermore, design and implementation of NLP tools for Persian are more challenging than other languages (e.g. English or German). Spelling tools are used widely for editing user texts like emails and text in editors. Also developing Persian tools will provide Persian progr...
متن کاملDetecting and Processing Figurative Language in Discourse
Figurative language poses a serious challenge to NLP systems. The use of idiomatic and metaphoric expressions is not only extremely widespread in natural language; many figurative expressions, in particular idioms, also behave idiosyncratically. These idiosyncrasies are not restricted to a non-compositional meaning but often also extend to syntactic properties, selectional preferences etc. To d...
متن کامل